Data-Driven Learning in an Incremental Grammar Framework

نویسندگان

  • Matthew Purver
  • Arash Eshghi
  • Julian Hough
چکیده

Overview Incremental processing of both syntax and semantics, both in parsing and generation, is of significant interest for modelling the human language capability, and for building systems which interact with it. Formal linguistics has made significant contributions to this; one example is the framework Dynamic Syntax, which provides an inherently word-by-word incremental grammatical framework. However, making this practical for computational models or systems involves building grammars with broad coverage on real data – a significant challenge. Here, we describe a method for inducing such a grammar from a corpus in which sentences are paired with semantic logical forms. By taking a probabilistic view, we hypothesise possible lexical entries – including entries for anaphoric elements – and learn a lexicon from their observed distributions without requiring annotation at the word level. The resulting grammar provides a resource for incremental semantic processing with good coverage, while learning grammatical constraints similar to a hand-crafted version.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Framework for Building an Efficient Incremental Intrusion Detection System

In this paper, a boosting-based incremental hybrid intrusion detection system is introduced. This system combines incremental misuse detection and incremental anomaly detection. We use boosting ensemble of weak classifiers to implement misuse intrusion detection system. It can identify new classes types of intrusions that do not exist in the training dataset for incremental misuse detection. As...

متن کامل

Inducing lexical entries for an incremental semantic grammar

We introduce a method for data-driven learning of lexical entries in an inherently incremental semantic grammar formalism, Dynamic Syntax (DS). Lexical actions in DS are constrained procedures for the incremental projection of compositional semantic structure. Here, we show how these can be induced directly from sentences paired with their complete propositional semantic structures. Checking in...

متن کامل

Incremental Grammar Induction from Child-Directed Dialogue Utterances

We describe a method for learning an incremental semantic grammar from data in which utterances are paired with logical forms representing their meaning. Working in an inherently incremental framework, Dynamic Syntax, we show how words can be associated with probabilistic procedures for the incremental projection of meaning, providing a grammar which can be used directly in incremental probabil...

متن کامل

Probabilistic induction for an incremental semantic grammar

We describe a method for learning an incremental semantic grammar from a corpus in which sentences are paired with logical forms as predicate-argument structure trees. Working in the framework of Dynamic Syntax, and assuming a set of generally available compositional mechanisms, we show how lexical entries can be learned as probabilistic procedures for the incremental projection of semantic str...

متن کامل

Learning Uniication-based Grammars Using the Spoken English Corpus

This paper describes a grammar learning system that combines model-based and data-driven learning within a single framework. Our results from learning grammars using the Spoken English Corpus (SEC) suggest that combined model-based and data-driven learning can produce a more plausible grammar than is the case when using either learning style in isolation.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013